Feature extraction based on zero-crossings with peak amplitudes for robust speech recognition in noisy environments
نویسندگان
چکیده
The Ensemble Interval Histogram (EIH) is an auditory model which can be used as a robust \front-end" for speech recognition systems. The utilization of multiple level-crossing detectors in the EIH provides frequency and intensity information, which may be useful for speech processing. Proper determination of the number of levels and the level values is very important for reliable performance of the system. In this paper, an analytic relationship is developed for variance and SNR of the level-crossing intervals as a function of the crossing level value, and a new feature extraction method based on zero-crossings with peak amplitudes is proposed for robust speech recognition in noisy environments. The proposed method not only can preserve intensity information, but also is robust to noise in estimating frequency information without the eeorts to determine the level values and the number of levels. Experimental results show the robustness of the proposed method.
منابع مشابه
Voice Command II: A DSP Implementation of Robust Speech Recognition in Real-World Noisy Environments
The \Voice Command" system, designed for isolated word recognition tasks in real-world noisy environments, was implemented on a xed-point DSP board to operate in real-time. Simple auditory model, i.e., zero-crossings with peak amplitudes (ZCPA) model, is used for noise-robust feature extraction , and neural network classiier recognizes input patterns. The system performance is further improved ...
متن کاملComparative evaluations of several front-ends for robust speech recognition
SPEECH RECOGNITION Doh-Suk Kimy, Jae-Hoon Jeongy, Soo-Young Leey, Rhee M. Kilz yDepartment of Electrical Engineering/ zDivision of Basic Science Korea Advanced Institute of Science and Technology 373-1 Kusong-dong, Yusong-gu, Taejon 305-701, Korea E-mail: [email protected] ABSTRACT Zero-crossings with peak amplitudes (ZCPA) model motivated by human auditory periphery is simple compared wi...
متن کاملZero Crossings with Peak Amplitudes and Perceptual Features for Robust Speech Recognition
It is known that certain properties of human speech perception are invariant or less affected by additive and reverberant noise. In this paper the zero crossings with peak amplitudes (ZCPA) model is evaluated for speech recognition with several perceptual properties of human hearing. Experimental results indicate that under white Gaussian noise, the maximum performance benefit is obtained by th...
متن کاملRobust speech recognition using features based on zero crossings with peak amplitudes
This paper presents an extensive study of zero crossings with peak amplitudes (ZCPA) features, that have earlier been shown to outperform both conventional and auditory-based features in presence of additive noise. The study starts by optimizing different parameters involved in ZCPA feature computation, followed by a comparison of ZCPA and MFCC features on two recognition tasks in different bac...
متن کاملAn Enhanced Feature Extraction Method for the Zero-Crossings with Peak Amplitudes Auditory Model Based on the Mean Discharge Rate
The zero-crossing with peak amplitudes (ZCPA) is an auditory model used in cochlear implant sound processing. However, when used as an ASR front-end, ZCPA performs better than the MFCC features in noisy conditions, but degrades in clean condition. This is because ZCPA is based on the dominant frequency principle, which emphasizes low frequencies but deemphasizes the fine temporal structures of ...
متن کامل